Supplementary Materials for “A statistical approach for identifying differential distributions in single-cell RNA-seq experiments”

ثبت نشده
چکیده

Here we present a sensitivity analysis to evaluate several computational approaches to obtain an estimate of the maximum a priori (MAP) partition in the Dirichlet process mixture model framework. Specifically, we use simulated data from mixtures of normals to compare an agglomerative greedy search algorithm [Ward Jr, 1963, Wang and Dunson, 2011], a Polya urn Gibbs sampling scheme [MacEachern, 1994, Bush and MacEachern, 1996, MacEachern and Müller, 1998], an iterative stochastic search [Shotwell, 2013], as well as a simple BIC optimization [Fraley et al., 2012]. The latter is implemented in the R package Mclust. The first three methods are implemented in the R package profdpm by Shotwell [2013]. In this sensitivity analysis we evaluate the ability of each procedure to detect the existence of multiple components when they truly exist as well as its false positive rate in detecting multiple components when there is only one. Since the expected number of components in the Dirichlet process mixture model depends on the value of α (the Dirichlet concentration parameter) as well as the sample size J [Antoniak, 1974], we vary both of these parameters. For each procedure, we evaluate each choice of J ∈ (30, 50, 100, 200) and α ∈ (0.001, 0.01, 0.05, 0.10, 1, 5) under four scenarios:

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data

Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...

متن کامل

A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data

Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...

متن کامل

I-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing

Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...

متن کامل

EBSeq-HMM: a Bayesian approach for identifying gene-expression changes in ordered RNA-seq experiments

MOTIVATION With improvements in next-generation sequencing technologies and reductions in price, ordered RNA-seq experiments are becoming common. Of primary interest in these experiments is identifying genes that are changing over time or space, for example, and then characterizing the specific expression changes. A number of robust statistical methods are available to identify genes showing di...

متن کامل

Sequence analysis EBSeq-HMM: a Bayesian approach for identifying gene-expression changes in ordered RNA-seq experiments

Motivation: With improvements in next-generation sequencing technologies and reductions in price, ordered RNA-seq experiments are becoming common. Of primary interest in these experiments is identifying genes that are changing over time or space, for example, and then characterizing the specific expression changes. A number of robust statistical methods are available to identify genes showing d...

متن کامل

powsimR: power analysis for bulk and single cell RNA-seq experiments

Summary Power analysis is essential to optimize the design of RNA-seq experiments and to assess and compare the power to detect differentially expressed genes in RNA-seq data. PowsimR is a flexible tool to simulate and evaluate differential expression from bulk and especially single-cell RNA-seq data making it suitable for a priori and posterior power analyses. Availability and implementation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016